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Abstract—In Wearable Body Sensor Networks (WBSWNs), the 
multi-sensor fusion strategy is widely utilized in Human Activity 
Recognition (HAR) problems. As the classical decision-level 
fusion strategy, the Belief Functions (BF) theory often uses a 
flat structure to fuse the outputs of basic classifiers. However, it 
is very challenge and difficult to accurately identify confusing 
activities such as walking upstairs and downstairs based on 
such flat structure. In this paper, a novel Hierarchical Belief K- 
Nearest Neighbors approach (Hie-BKNN) for activity recognition 
is proposed. First, the data-driven hierarchical tree is constructed 
by using the spectral clustering strategy. Then, inspired by BF 
theory and classical K-nearest neighbors, the pair-aware belief 
masses are calculated for each node and the belief hierarchical 
tree can be generated. Afterwards, we adopt a novel hierarchical 
fusion strategy to fuse all involved belief hierarchical trees. 
Finally, by selecting the probability maximization path on the 
entire tree, the label of a test sample can be effectively predicted. 
The experimental results show that our proposed Hie-BKNN 
approach can precisely recognize daily activities of two different 
UCI public data sets, i.e., Smartphone and WISDM datasets. 


Index Terms—Miulti-Sensor Fusion, Activity Recognition, Hier- 
archical Classification, Belief Functions Theory, Wearable Body 
Sensor Networks. 


I. INTRODUCTION 
A. Background and Research Motivation 


N recent years, with the development of Internet of Things 

(loT) technology, Human Activity Recognition (HAR) in 
Body Sensor Networks (BSNs) is widely used in monitoring 
environmental assisted living [1], sports injury detection [2], 
health management [3], medical diagnosis [4], and elderly 
care [5]. Generally, traditional activity recognition models 
[6] usually rely on machine learning models to identify ac- 
tivities by using the signals sensed from a single wearable 
sensor. However, many recent works [7-10] have been al- 
ready proved that HAR model coupled with multi-sensor data 
fusion techniques is an effective solution for high-precision 
activity recognition from imperfect sensor samples such as 
noise signals, data loss or inconsistency. As the classical 
decision-level fusion strategy, Belief Functions (BF) theory 
[11], also called Dempster-Shafer evidence theory (DST), is 
an important reasoning framework for uncertainty modeling, 
which has obvious mathematical advantages in fusion of 
imperfect information [12]. Thus, BF theory has been widely 
used in HAR fields because of its ability in describing and 
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eliminating the uncertainty in the activity recognition prob- 
lems. For example, in our previous works [7], the selected 
kernel density estimation models for each specific activity 
were first built and combined with Dezert-Smarandache theory 
(DSmT) in the decision-level fusion stage to predict the label 
of unknown activities. Following this line, we further extended 
the type of classical belief structure in the theory of BF 
and proposed the hesitant fuzzy belief structure to deal with 
HAR problems in [13]. Besides, Noor et.al [14] integrated the 
ontological reasoning mechanism with DST to provide support 
for handling uncertainty in the activity recognition. Although 
the performance of the BF-based activity recognition models 
has been improved, it is often difficult to recognize those 
fine-grained and confusing activities, such as going up and 
down stairs, opening and closing doors, and so on. This is 
mainly because that the classical flat structure is often applied 
in the fusion stages of BF theory which could easily neglect 
the inter-class taxonomic relationships. Fortunately, there often 
exists the potential hierarchy structure in daily activities, which 
helps to efficiently organize raw data with various categories. 
Because of such hierarchical structure between activities, the 
more difficult and confusing activity classification tasks can be 
divided into several simpler subtasks, so they can be solved 
more effectively. In the literature [15] and [16], authors fully 
compared and discussed the advantages and disadvantages 
of these two strategies: hierarchical classification and flat 
classification, and then gave their conclusion that hierarchical 
classification is more robust in classification tasks than flat 
classification. 


B. Challenges 


Although under the framework of BF theory, the activity 
recognition model based on the hierarchical structure has a 
predictable classification effect, it also faces many challenges: 


e Construction of hierarchical structure: for the BF- 
based hierarchical activity recognition, the key problem is 
to construct a good hierarchical label structure. However, 
in real scenarios, the prior knowledge of how to organize 
data hierarchically is usually limited. And we usually only 
know some rough relationships between categories which 
are not enough for constructing a complete hierarchical 
structure; 

e Calculation of belief alignment: In the theory of BF, 
the belief assignments need to be calculated before the 
process of decision-level fusion. Thus, how to construct 
the belief masses of all involved categories reasonably 
and accurately in the hierarchical structure is particularly 
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important. However, such problem is still one of the 
difficulties in the application of BF theory [17]; in ad- 
dition, the traditional combination rules in BF theory are 
only applicable to combine Sources of Evidence (SoEs) 
with the flat structure, it is very necessary to propose 
new fusion rules for fusing SoEs with the hierarchical 
structure; 

e Implementation of hierarchical reasoning: performing 
hierarchical reasoning based on a hierarchical structure 
is also a challenge [18]. That is to say, in the process of 
hierarchical reasoning, it is very important to study how 
to prevent the errors propagation from the root node to 
the leaf node. 


C. Main Contributions 


In this paper, we propose a novel Hierarchical Belief K- 
Nearest Neighbors classifier (Hie-BKNN) for human activity 
recognition. The main contributions of this work are summa- 
rized as follows: 

e We propose a data-driven hierarchical structure learning 
method. First, Dynamic Time Warping (DTW) is used 
as the similarity metric to compute the similarity degree 
between activities. Then, the spectral clustering is recur- 
sively applied to construct the hierarchical tree; 

e Through the nearest neighbors search, the belief mass 
of the unlabeled sample for each node is calculated 
to generate the belief hierarchical tree. Then, a new 
hierarchical fusion rule is also proposed to combine all 
involved belief hierarchical trees; 

e Finally, according to the interpretable maximum prob- 
ability decision rule, we can classify unlabeled sam- 
ples following the top-down optimal path in the fused 
belief hierarchical tree. Comprehensive experiments on 
two public activity recognition datasets demonstrate the 
superiority of the proposed Hie-BKNN. 

The rest of this article is organized as follows. Section 
II reviews the basics of BF theory. Section III presents the 
proposed Hie-BKNN. Section IV reports the experimental 
results. Finally, Section V concludes this article. 


II. BASICS OF BELIEF FUNCTION THEORY 


In DST framework, the Frame of Discernment (FoD)! 
© = {61,...,On} (n > 2) is a set of exhaustive and exclusive 
elements (hypotheses) which represent the possible solutions 
of the problem under consideration. Shafer’s model assumes 
ON0' = 0 for 0 4 @’ in O. A BBA m(-) is defined by the map- 
ping: 2° ++ [0,1], verifying m(@) = 0 and 7p.46 m(9) = 
1. The belief (or credibility) and plausibility functions are 
respectively defined by Bel(#) + Dore2e}97c9 (8) and 
P10) + e'e2° |0'n0 40 m(6'). BI(0) & [Bel(0), P1(@)] is 
called the belief interval of @. 

In order to combine two distinct SoEs, the classical DS 
rule in [11]) was proposed and defined by mpgs(@) = 0 and 
VO € 2® \ {0}: 


0",07€2° 9/9" =0 m4 (8)m2(0") 
1- 26",0€29 19/6" =0 my, (8")m2(0") : 


'Here, we use the symbol £ to mean equals by definition. 


(1) 


mps(6) = 


Il]. HIERARCHICAL BELIEF K-NEAREST NEIGHBORS 
FOR HAR 


In this section, we first introduce the principle of hierar- 
chical tree construction, then describe the specific steps of 
Hie-BKNN classifier in details. 


A. Hierarchical Tree Construction 


As shown in previous works [18], it is reasonable to classify 
highly similar sequential classes into a super class. Generally, 
two main necessary steps are included: 1) affinity matrix is 
computed which aims to measure the inter-class similarity, 
and 2) the spectral clustering is utilized for hierarchical tree 
construction. 


Algorithm 1: DTW algorithm 


Input: X = {x1,22,--- 
time series. 
Output: The optimal warping path; The distance of X and X’. 
1 Initialize: DTW (1,1) = di,1 = \/ (21 — 24)”: 
2 for 7 = 2,--- ,Vi do 
3 for 7 = 2,--- , V2 do 


,vy,} and X’ = {x/,25,--- thy, }: two 


5 

6 Dz = di,j t DTW(i 1,7 lt) 
7 D3 = di,j + DTW (i, 9 _ 1); 
8 

9 


DTW = min(D,, Dao, D3); 
The optimal 
path, ;=min_index((i 


1,9), @—1,3—1), G7 —1)). 


10 end 


1) Calculating Affinity Matrix: In order to obtain an affinity 
matrix, we first need to select a robust time-series distance 
metric. Until now, it is still an unsolved issue to choose 
the robust distance metric to measure the similarity degree 
between daily activity-based time series data [19]. In general, 
the Euclidean distance [20] is widely used to calculate the 
sum of the distances between corresponding points among 
the time series. However, Euclidean distance is difficult to 
accurately measure distorted and shifting time-series points. 
In this article, we utilize DTW [21] to effectively measure 
the difference between time series. And the pseudo-code of 
the DTW is presented in Algorithm 1. Then, we can get the 
element of affinity matrix, which is calculated as follows: 


DTW (Xw,Xi,) 
Ay! 5! = exp . (2) 


) 


where 6 is the pre-defined parameter and 7’, 7’ € [1,c]. By 
using DTW, we can construct the affinity matrix, which is 
mathematically denoted as 


ai1 Ale 
A=| oo: (3) 


Gel Gee 


Then the spectral clustering algorithm [22, 23] is utilized to 
generate the hierarchical tree. 
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2) Hierarchical Tree Construction: Here, the construction 
of the hierarchical tree adopts a top-down construction strat- 
egy. In the process of the actual construction, the tree nodes 
contained in each layer of the hierarchical tree will be recur- 
sively divided. First, considering all the activities categories 
contained in the root node, the spectral clustering is then 
performed on the entire affinity matrix for the root node. 
In this paper, we just focus our discussions on hierarchical 
binary trees. Thus, all non-leaf nodes in the hierarchical tree 
are clustered into two groups (left child node and right child 
node). At the same time, considering that each child node may 
contains some categories (the number of categories is greater 
than 1), then the spectral clustering will be also conducted 
on the affinity sub-matrix corresponding to the categories 
contained in the child node. By analogy, all nodes are clustered 
through recursive spectral clustering until the following rule is 
met: the current node is a leaf node, that is, this node contains 
only one category. 

Note that the set of classes contained in the node v is 
denoted by C(v). The ith children node of the node v contains 
the class set Ct. For each node v, the union of the class 
sets contained in the node v, that is, U;-c,) C= Ua, 
Moreover, any pairwise sibling nodes do not overlap, which 
satisfies C'C} = 0,i 4 j. Thanks to the construction of such 
hierarchical tree, the elements and their combinations in the 
target class set can effectively correspond to the nodes in the 
derived tree structure: the root node represents the completely 
unknown class (in BF theory, this concepts is mathematically 
denoted by ©); those middle nodes corresponds to the coarse- 
grained category which represents the union of several differ- 
ent classes (these nodes can be denoted by disjunctive focal 
elements in BF theory) and the leaf nodes can be regarded as 
the most fine-grained classes (singletons in BF theory). 


B. Hierarchical Belief K-Nearest Neighbors Classifier 


In this part, we describe the main idea of our proposed Hie- 
BKNN method based on the derived hierarchical tree-based 
structure. There are two points need to mention: 


e Firstly, we do not use the classical Euclidean distance 
to measure the distances between time series. Instead, 
the nearest neighbors are selected based on the distances 
calculated by DTW; 

e Secondly, once all nearest neighbors of the testing sample 
for each node in hierarchical tree are chosen, these 
selected samples will be used to calculate the belief 
masses and then determine the label of the unknown 
testing sample after hierarchical fusion; 

e Finally, the best decision-making path in hierarchical tree 
is determined by the maximum value of belief masses. 


1) Calculating the local belief masses by using the nearest 
neighbors: Let {(X;, Y;)|i = 1,---,N} be a collection of 
N training pairs, in which X; = [x1,--- ,xy]? is the ith 
training sample with V features and Y; € {yi,--- , yc} is 
the corresponding class label. Considering that in the derived 
hierarchical tree, all involved training samples have already 
clustered into Q groups, the & nearest neighbors of the testing 
sample should also be selected in these Q groups, respectively. 


3 


Given a query instance X*, its class membership can be 
determined through the following steps: 


e For each X‘, finds k nearest neighbors of X‘ from 
{(XP, Y,7)|’ = 1,--- , N’} in v node using DTW and 
then puts them into K N N_X¢ set. And the set of classes 
contained in this node v is denoted by C(v) = Y,”, which 
is the subset of the original class label {y1, yo,--- , yc}; 

e Each nearest neighbor of X‘ in K NN_X¢ is considered 
as an item of evidence that supports certain hypotheses 
regarding the class membership of X‘. Let X/ be one of 
its k nearest neighbors with class label Y,” = yg € Y;,’. 
The mass function induced by X/, which supports the 
assertion that X‘ also belongs to yg is 


(4) 


where d;,;, is the distance between X// and X © calculated 
by DTW, while a and ¥ are pre-defined parameters and 
in this paper, we just set the values of a and y equal to 
1 and 0.05. 

e Then, we can obtain the belief tree-based structure for 
the testing sample X* based on its corresponding nearest 
neighbors. The principle of construction is quite simple: 
through the nearest neighbor samples in each node v 
of hierarchical tree, the belief assignment is calculated 
based on (4) and then this value is regarded as the weight 
linking the corresponding node in the tree structure. 


min ({Yq}) = aexp(—y- d?,). 


Based on the above steps, we can obtain the k belief 
hierarchical trees, which is determined by the & selected 
nearest neighbors. In order to improve the final classification 
accuracy, we need to fuse the all belief hierarchical trees. 
However, the traditional fusion rules in BF theory can not be 
directly used for fusing SoEs under such hierarchical structure. 
Therefore, we need to propose the new hierarchical fusion rule. 

2) Combining all neighbors’ knowledge with hierarchical 
fusion strategy: As we discussed before, we model the multi- 
granularity predefined activities according to the concept of 
FEs in BFs theory. That is to say, for the leaf nodes in tree, 
the most fine-grained activities are modeled by singletons 
and the related internal nodes are described by disjunctive 
FEs. Through the belief KNN in each layer of tree-based 
hierarchical structure, the connection weights of each node 
can be calculated and these connection weights are regarded 
as the belief masses of FEs in tree structure. 

In order to reduce the risk of error propagation caused by 
nearest neighbor noise, we propose a novel hierarchical fusion 
strategy to fuse tree-based belief mass function based on the 
selected nearest neighbors. Specifically, the two-dimensional 
BBAs in the same tree-layer are first fused with Dempster’s 
rule (1) and then the derived fusion BBAs are regarded as the 
connected weights of the corresponding nodes. Then, the final 
belief mass of each leaf node is successively multiplied by 
the connection weights from the root node to the leaf node. 
Finally, we can predict the type of the unknown activity based 
on the principle of probability maximization. 

The principle of our proposed tree-based combination rule 
for nearest neighbors is illustrated in Fig.1. As we can observe 
in Fig.l, k belief based hierarhical trees are derived from 
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Fused Belief Hierarchical Tree 


Fig. 1: Belief Hierarchical Tree Combination Rule. 


nearest neighbors trained by belief KNN and our task here is to 
combine all these involved belief hierarchical trees. In order to 
briefly show the principle of our fusion strategy for combining 
belief hierarchical trees, we set the value of k equals to 2 
and three BBAs from the belief hierarchical tree based on Ist 
nearest neighbor are given for instance: 


my, = {m11(43 U 92), m11(A1 U @4)}; (5) 
my2 = {1m712(43), m12(62)}; (6) 
my 3 = {7713(41),m13(64)}. (7) 


Similarly, for the belief hierarchical tree based on 2st nearest 
neighbor, three involved BBAs are also included: 


m1 = {7m21(43 U 82), m21 (41 U O4)}; (8) 

M2 = {M22(93), m22(F2)}; (9) 

M23 = {M23(1), M23 (4) }. (10) 

Before the fusion step, we need to ensure that each piece of 
BBAs meets the normalization condition. Then, in our fused 


belief hierarchical tree, the corresponding BBAs in the same 
layer are combined by the classical DS(-) (1): 


my, = DS(my11,mMa1) = {m1 (43 U O2), M21 (41 U A4)}; 


(11) 
myo = DS(mj2,m22) = {mF2(93), MF2(2)}; (12) 
my3 = DS(mj43, m23) = {m3(61), mf3 (G4) }. (13) 


Once the learned hierarchical tree is obtained, the label of 
the test sample needs to be predicted. And the principle of 
decision making here is very simple and the selection of the 
best path (from the root node to the leaf node) is completely 
based on the maximum value of the corresponding belief mass. 


IV. EXPERIMENTS AND DISCUSSIONS 


In this part, two famous UCI datasets: Smartphone?[24] 
and WISDM dataset*[25] are applied to verify the proposed 
Hie-BKNN model. More descriptions of these two involved 
datasets Smartphone and WISDM can be found in [24] and 
[25]. 

1) Results on UCI Smartphone Dataset: 


*http://archive.ics.uci.edu/ml/datasets/Smartphone-Based+Recognition+of 
+Human+dActivities+and+Postural+Transitions. 


3http://archive.ics.uci.edu/ml/datasets/WISDM+Smartphone+and+Smartwatch 


+Activity+and+Biometrics+Dataset+. 


a) Hierarchical tree construction: According to the al- 
gorithm described in Section III-A, the hierarchical tree of six 
daily activities involved in UCI Smartphone are constructed 
by using DTW distance and spectral clustering strategy. The 
specific steps are as follows: first, the original Smartphone 
dataset is divided into training (70%) and testing (30%) 
sets. And then, we further divide the obtained training sets 
into six sub-training sets based on the six daily activities 
(Sitting, Standing, Laying, Walking, Walkingyy, Walkingan 
which are denoted by 01, 62,03, 04, 05 and 06, respectively). 
Afterwards, the classical DTW measure is applied to calculate 
the similarity degree between daily activities and the difference 
matrix is obtained in Table I. It can be easily observed that 
there exists two potential clusters: the static activities (Sitting, 
Standing and Laying-red mark in Table I) and the dynamic 
activities (Walking, Walkingy,, Walkingan-green mark in 
Table I). It can be found that the differences between static and 
dynamic classes are large, but the differences within classes 
(static or dynamic) are relatively small. After obtaining the 
difference matrix, we use (2) to convert the difference matrix 
into the affinity matrix, which is given in Table I and the 
parameter 6 is set to 1000. As can be seen in Table II, the 
smaller the difference degree between activities, the higher the 
value in affinity matrix. Based on the derived affinity matrix, 
the spectral clustering strategy is used to cluster six groups 
of activities iteratively. Here, in our following experiments, 
we directly apply the built-in spectral clustering function in 
MATLAB software to realize clustering (the parameter settings 
of this function are as follows): 


Spectralcluster(Af finityMatrix, Numberof Cluster, 
‘Distance’ | precomputed' / LaplacianN ormalization’ , 


/ scat 
symmetric). 


Considering that the hierarchical tree structure ob- 
tained by spectral clustering is affected by the parameter 
Numberof Cluster, and we need to make the final leaf node 
in the tree structure contain only one activity class. Thus, the 
value of NumberofCluster is set to six so as to guarantee 
the number of classes in C(leafnode) equals to one. 


TABLE I: Difference Matrix Between Six Activities on UCI 
Smartphone With DTW. 


Sitting Standing Laying Walking Walk,» Walkan 
Sitting 0 phy ee) 758.5 1140.2 1065.9 1250.9 
Standing ba? 0 802.4 1134 1064.2 1240.8 
Laying 758.5 802.4 0 1256.3 1187.8 1355.9 
Walking 1140.2 1134 1256.3 0 639.5 820.5 
Walkup 1065.9 1064.2 1187.8 639.5 0 817.5 
Walkan 1250.9 1240.8 1355.9 820.5 817.5 0 


TABLE II: Affinity Matrix Between Six Activities on UCI 


Smartphone. 
Sitting Standing Laying =Walking = Walkup Walkan 
Sitting 1 0.4690 0.4683 0.3197 0.3432 0.2863 
Standing 0.4690 1 0.4482 0.3218 0.3450 0.2892 
Laying 0.4683 0.4482 1 0.2847 0.3049 0.2577 
Walking 0.3197 0.3218 0.2847 1 0.5275 0.4402 
Walkup 0.3432 0.3450 0.3049 0.5275 1 0.4415 
Walkan 0.2863 0.2892 0.2577 0.4402 0.4415 1 
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b) Belief hierarchical tree construction: Based on the 
hierarchical tree structure derived from spectral clustering, we 
use belief k-NN described in III-B to construct the belief hier- 
archical tree: first, the value of the nearest neighbor parameter 
k is set to 7. It means that, for the testing sample, seven nearest 
training time series are chosen based on DTW measure to 
calculate the belief masses through (4). For example, in order 
to illustrate the principle of hierarchical fusion, two obtained 
belief hierarchical trees are shown in Fig.2. 


Belief Hierarchical Tree-1 


Belief Hierarchical Tree-2 


Fig. 2: Two Belief Hierarchical Trees in UCI Smartphone 
Dataset. 


c) Hierarchical Fusion: Then, these two belief hi- 
erarchical tree can be combined by using our proposed 
hierarchical fusion rule. First, there exists five BBAs 
(m41,M42,M43,M14, M45) in belief hierarchical tree-1: 


my11(@1 U 62 U 03) = 0.9891, m11 (04 U Os U 06) = 0.0109; 
(14) 
my42(@1 U 63) = 0.5769, m12(82) = 0.4231; (15) 
m13(@4 U 65) = 0.8394, m13(@6) = 0.1706; (16) 
my4(@1) = 0.5010, m14(03) = 0.4990; (17) 
m15(@4) = 0.2987, mis (65) = 0.7013. (18) 
Similarly, there exists another five BBAs 


(M21, M22, M3, M4, M25) in belief hierarchical tree-2: 


mai (01 U 2 U 03) = 0.9893, m1 (04 U 65 U 8g) = 0.0107; 
(19) 
m22(61 U 63) = 0.7183, m22(82) = 0.2817; (20) 
mo3(04 U 05) = 0.5503, m23(0g) = 0.4497; (21) 
m24(01) = 0.7094, m24(03) = 0.2906; (22) 
m5 (04) = 0.2673, mo5(65) = 0.7327. (23) 


Then, we can combine these BBAs in the corresponding 
layer by using DS(-) and get the final fused five BBAs: 


my; = DS(m}1, m1) = (0.9999, 0.0001); (24) 
mp2 = DS(my2, M22) = (0.7766, 0.2234); (25) 
m3 = DS(m}3,m93) = (0.8561,0.1439); (26) 
m4 = DS(m44,m94) = (0.7102, 0.2898); (27) 
ms = DS(m}5, m5) = (0.1345,0.8655). (28) 


d) Decision making: Finally, the final best decision- 
making path in hierarchical tree can be found based on the 
principle of maximum belief assignment which is denoted in 
Fig.3 (the red line is the final decision-making path). And in 
this case the label of the testing sample belongs to 0. 


Fused Belief Hierarchical Tree 


Fig. 3: The decision-making path in Belief Hierarchical 
Trees. 


e) Compared to several classical algorithms: As dis- 
cussed earlier, based on the UCI smartphone data set in 
the experiment, we compared the proposed method with the 
classic algorithms. The comparison results are shown in Table 
Ill. In addition, the confusion matrix of Hie-BKNN is further 
given (Fig.4). Benefiting from the hierarchical tree structure, 
the performance of our proposed Hie-BKNN is better than all 
the algorithms mentioned, and Hie-BKNN achieved the best 
performance (90.30%) in all the recognition tasks of activity 
recognition. 


TABLE III: The Performance Comparisons on Smartphone. 


Method Accuracy 
Decision Tree 87.4% 
Naive Bayes 88.9% 
K-Means 52.1% 
AdaBoost 40.8% 
Gaussian Mixture 49.8% 
Hyperbox Neural Network [27] 87.4% 
FW K-Nearest Neighbor [28] 87.8% 
FW Naive Bayes [28] 88.6% 
Ours Hie-BKNN 90.30% 


Confusion Matrix for Hie-BKNN in UCI Smartphone Dataset. 


Sitting MSZ 9.00 3.00 0.00 0.00 0.00 
450 
Standing | 35.00 BREmee 2.00 0.00 0.00 0.00 — 
350 
Laying} 36.00 54.00 exieRele} 0.00 0.00 0.00 300 


walking] 0.00 1.00 0.00 


Walk! 0.00 0.00 0.00 35.00 Zeyaele) 


Walk own 


0.00 0.00 0.00 3.00 12.00 szzxele 


Sitting Standing Laying Walking Walky,, Wally, 


Fig. 4: Confusion Matrix of Hie-BKNN for Smartphone. 


2) Results on UCI WISDM Dataset: 

a) Compared to the state-of-the-art methods: Similar to 
the experiments in the Smartphone dataset, we compare and 
analyze the performance of the proposed method and the 
classic algorithm on the WISDM dataset. The experimental 
results are shown in Table IV. It can be clearly seen that, 
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compared to all these mentioned algorithms, the Hie-BKNN 
model has the best performance (93.71%). Compared with 
the classic method mentioned, the biggest advantage of this 
method is that, in this paper, we mainly rely on the hierarchical 
modeling capabilities with our new hierarchical decision level 
fusion strategy, which help to achieve higher accuracy of 
activity recognition. 


TABLE IV: The Performance Comparisons on WISDM. 


Method Accuracy 
ANN 91.52% 
ELM 92.06% 
RF 91.77% 
Deep LSTM 91.42% 
Hyperbox Neural Network [27] 81.31% 
FW K-Nearest Neighbor [28] 91.12% 
FW Naive Bayes [28] 90.73% 
DSmT-Based Kernel Density Estimation [7] 91.50% 
Our Proposed Method (Hie-BKNN) 93.71% 


V. CONCLUSION 


In this paper, we propose a novel hierarchical belief K- 
Nearest Neighbors classifier (Hie-BKNN) for activity recog- 
nition in body sensor networks. Compared to existing works, 
Hie-BKNN first learn the hierarchical tree structure from the 
training datasets. Then, in the testing stage, the corresponding 
belief hierarchical tree is derived based on the belief K- 
Nearest Neighbors by using DTW distance. Moreover, the 
novel hierarchical fusion rule is proposed to combine all belief 
hierarchical trees to make the activity recognitions. Finally, 
we conducted extensive experiments on two publicly available 
activity classification datasets: Smartphone and WISDM. The 
results demonstrate the superiority of Hie-BKNN against other 
state-of-the-art traditional methods. 
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